The bivariate statistical analysis of environmental (compositional) data.

نویسندگان

  • Peter Filzmoser
  • Karel Hron
  • Clemens Reimann
چکیده

Environmental sciences usually deal with compositional (closed) data. Whenever the concentration of chemical elements is measured, the data will be closed, i.e. the relevant information is contained in the ratios between the variables rather than in the data values reported for the variables. Data closure has severe consequences for statistical data analysis. Most classical statistical methods are based on the usual Euclidean geometry - compositional data, however, do not plot into Euclidean space because they have their own geometry which is not linear but curved in the Euclidean sense. This has severe consequences for bivariate statistical analysis: correlation coefficients computed in the traditional way are likely to be misleading, and the information contained in scatterplots must be used and interpreted differently from sets of non-compositional data. As a solution, the ilr transformation applied to a variable pair can be used to display the relationship and to compute a measure of stability. This paper discusses how this measure is related to the usual correlation coefficient and how it can be used and interpreted. Moreover, recommendations are provided for how the scatterplot can still be used, and which alternatives exist for displaying the relationship between two variables.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Use of Robust Factor Analysis of Compositional Geochemical Data for the Recognition of the Target Area in Khusf 1:100000 Sheet, South Khorasan, Iran

The closed nature of geochemical data has been proven in many studies. Compositional data have special properties that mean that standard statistical methods cannot be used to analyse them. These data imply a particular geometry called Aitchison geometry in the simplex space. For analysis, the dataset must first be opened by the various transformations provided. One of the most popular of the a...

متن کامل

بررسی نقش میانجی راهبردهای ترکیبی رقابتی و رویکرد مبتنی بر منابع در تأثیر ساختار بر عملکرد سازمانی

Today, companies in the challenging conditions will be successful while acquiring sufficient knowledge and recognition regarding environmental challenges create progress and improvement in their performance. In this regard, the present study has investigated the effect of structure on organizational performance taking into account the intermediary role of competitive-compositional strategies an...

متن کامل

Stress-Strength and Ageing Intensity Analysis via a New Bivariate Negative Gompertz-Makeham Model

In Demography and modelling mortality (or failure) data the univariate Makeham-Gompertz is well-known for its extension of exponential distribution. Here, a bivariate class of Gompertz--Makeham distribution is constructed based on random number of extremal events. Some reliability properties such as ageing intensity, stress-strength based on competing risks are given. Also dependence properties...

متن کامل

Transition Models for Analyzing Longitudinal Data with Bivariate Mixed Ordinal and Nominal Responses

In many longitudinal studies, nominal and ordinal mixed bivariate responses are measured. In these studies, the aim is to investigate the effects of explanatory variables on these time-related responses. A regression analysis for these types of data must allow for the correlation among responses during the time. To analyze such ordinal-nominal responses, using a proposed weighting approach, an ...

متن کامل

Univariate statistical analysis of environmental (compositional) data: problems and possibilities.

For almost 30 years it has been known that compositional (closed) data have special geometrical properties. In environmental sciences, where the concentration of chemical elements in different sample materials is investigated, almost all datasets are compositional. In general, compositional data are parts of a whole which only give relative information. Data that sum up to a constant, e.g. 100 ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • The Science of the total environment

دوره 408 19  شماره 

صفحات  -

تاریخ انتشار 2010